🧠 Philosophy of Mind - vhpoet · Scour

Evidence on language model consciousness

lesswrong.com·15h

🪞Metacognition

Flag this post

We are building AI slaves. Alignment through control will fail

utopai.substack.com·1d·

Discuss: Substack

Flag this post

Anthropic Research Shows How LLMs Perceive Text via @sejournal, @martinibuster

searchenginejournal.com·2d

Flag this post

Emergent Introspective Awareness in Large Language Models

transformer-circuits.pub·2d·

Discuss: Hacker News, r/LocalLLaMA

🪞Metacognition

Flag this post

Take Weird Ideas Seriously

notboring.co·2d·

Discuss: Hacker News

Flag this post

GenAI Poisoning: How Fewer Than 100 Samples Can Corrupt a Multi-Billion Parameter Model

pub.towardsai.net·1d

Flag this post

LLM-generated text is not testimony

lesswrong.com·4h

Flag this post

How I Learned to Stop Worrying and Love My Shitty Life

thedriftmag.com·1d·

Discuss: Hacker News

Flag this post

Introducing Project Telos: Modeling, Measuring, and Intervening on Goal-directed Behavior in AI Systems

lesswrong.com·1d

Flag this post

Show HN: Why write code if the LLM can just do the thing? (web app experiment)

github.com·1h·

Discuss: Hacker News

Flag this post

“Gender without Children”

marginalrevolution.com·14h

Flag this post

xkcd.com·1d

🏛️Philosophy

Flag this post

Carlo Rovelli’s Radical Perspective on Reality

quantamagazine.org·3d·

Discuss: Hacker News, Hacker News

🏛️Philosophy

Flag this post

Asking Paul Fussell for Writing Advice

lesswrong.com·15h

🏛️Philosophy

Flag this post

Signs of introspection in large language models

anthropic.com·3d·

Discuss: Hacker News, Hacker News, r/ClaudeAI

🪞Metacognition

Flag this post

Dating: A mysterious constellation of facts

dynomight.net·2d·

Discuss: Hacker News

Flag this post

OpenAI updates terms to forbid usage for medical and legal advice

openai.com·20h·

Discuss: Hacker News

Flag this post

Reasoning Models Reason Well, Until They Don't

arxiv.org·4d·

Discuss: Hacker News

Flag this post

Freewriting in my head, and overcoming the “twinge of starting”

lesswrong.com·18h

🪞Metacognition

Flag this post

Debugging Despair ~> A bet about Satisfaction and Values

lesswrong.com·1d

Flag this post

Loading more...